Changes on the Horizon for the Multimedia Community
نویسنده
چکیده
The Impact of Deep Learning The development of AI algorithms, represented by deep learning, has bolstered multimedia research. In particular, deep learning has led to a multimodality-based algorithm framework, enabling the effective fusion and use of cross-domain data. Take image and video captioning, for example. A couple of years ago, tagging was the only way to describe images and videos. But now, deep learning has helped establish the connection between computer vision and natural-language processing (NLP), resulting in tagging being replaced by coherent, naturallanguage sentence generation. Image captioning took inspiration from recent advances in machine translation. The captioning process is based on a Convolutional Neural Network (CNN) that encodes an image into a compact representation. Then, a Recurrent Neural Network (RNN)/Long Short Term Memory (LSTM) generates the corresponding sentence. The latest research has also introduced interdisciplinary theories and methodology into video captioning. Unlike image captioning, both spatial details in a single frame and temporal information between frames play an important role in video captioning. Consequently, it is usually more complex than image captioning. As related fields and hardware continue to develop, the generation of multiple sentences or even entire paragraphs could become a reality for image and video captioning. Moreover, more natural user interaction systems could become available, and modalities might not be limited to computer vision and NLP. Instead, voice, depth, and text features could be further incorporated into the deep learning pipeline. However, despite research progress in academia, lots of problems still stand in the way of applying deep learning in real life. For example, interpretability is crucial for multimodality applications in the medical domain, so we need to delve deeper in this area. Also, in mobile applications, imageand videocaptioning tasks require a tradeoff between speed and accuracy due to the graph search required for inferencing. We’ve still got a long way to go, but looking to the future, as deep learning bridges multiple modalities and we acquire more information, it should provide better AI services to users.
منابع مشابه
Effect of Multimedia Self-Care Education on Quality of Life in Burn Patients
BACKGROUND Burn injuries can have adverse effects on quality of life of patients and can disturb their physiological, psychological, social and spiritual well-being. This study aimed to investigate the effect of multimedia self-care program on quality of life in burn patients. METHODS This Randomized controlled clinical trial was conducted from November 2015 to December 2016. The samples w...
متن کاملThe effect of instructional multimedia on communicational skills learning of autism students
Background: This research aimed to estimate the effect of instructional multimedia on communicational skills learning of autism students at 3rd grade elementary school in Tehran. Methods: The method of this research was quasi- experimental and population was all the autism students at 3rd grade elementary school in Tehran.16 students were chosen by accessible sampling and they were randomly div...
متن کاملThe effect of communication skills training through multimedia on the self-esteem of female students with hearing loss
The current study aims to peruse the effectiveness of teaching communicative skills through multimedia on the self-esteem of girl students with hearing problem. The research methodology is of semi empirical type and statistical population encompassed the whole girl students with hearing problem at sixth grade in Tehran city. With sampling method, thirty students were selected and randomly were ...
متن کاملImpact of multimedia as a teaching tool on the performance of the 1st year MBBS students for academic achievements in India
Background: There are various methods of teaching adopted by teachers/professors in medical colleges to deliver the lectures to the students. The present study aimed to find out the impact of multimedia as a teaching tool on the performance of the 1st year Bachelor of Medicine and Bachelor of Surgery (MBBS) students for academic achievements. Methods:</...
متن کاملComparing the Effect of Multimedia and Practical Education on the Oral Hygiene of Orthodontic Patients
Introduction: Brackets and other fixed orthodontic appliances not only make tooth brushing more difficult, but also provide a suitable environment for the accumulation of plaque. To prevent this situation, dentists usually educate their patients to control the plaque formation and maintain good oral hygiene. This study aims to compare the effect of multimedia and practical education on the know...
متن کاملA Fuzzy Based Approach for Rate Control in Wireless Multimedia Sensor Networks
Wireless Multimedia Sensor Networks (WMSNs) undergo congestion when a link (or a node) becomes overpopulated in terms of incoming packets. In WMSNs this happens especially in upstream nodes where all incoming packets meet and directed to the sink node. Congestion in networks, if not handled properly, might lead to congestion collapse which deteriorates the quality of service (QoS). Therefore, i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEEE MultiMedia
دوره 24 شماره
صفحات -
تاریخ انتشار 2017